GSMFlow: Generation Shifts Mitigating Flow for Generalized Zero-Shot Learning
نویسندگان
چکیده
Generalized Zero-Shot Learning (GZSL) aims to recognize images not only for seen classes but also unseen ones by transferring semantic-visual relationships from the classes. It is an intuitive solution take advantage of generative models hallucinate realistic samples based on knowledge learned However, due generation shifts, synthesized most existing methods may drift real distribution data. To address this issue, we propose a novel flow-based framework that consists multiple conditional affine coupling layers learning data generation. Specifically, investigate and three essential problems trigger i.e., semantic inconsistency, variance collapse, structure disorder. First, improve reflection semantic information in generated samples, proactively embed into transformation each layer. Second, promote intrinsic feature variance classes, introduce boundary sample mining strategy with entropy maximization discover ambiguous visual variants prototypes hereby calibrate decision classifiers. Third, relative positioning proposed revise attribute embeddings, guiding which fully preserve inter-class geometric structure further avoid disorder space. Extensive experimental results four GZSL benchmark datasets demonstrate GSMFlow achieves state-of-the-art performance GZSL.
منابع مشابه
A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning
Prevalent techniques in zero-shot learning do not generalize well to other related problem scenarios. Here, we present a unified approach for conventional zero-shot, generalized zero-shot and few-shot learning problems. Our approach is based on a novel Class Adapting Principal Directions (CAPD) concept that allows multiple embeddings of image features into a semantic space. Given an image, our ...
متن کاملGeneralized Zero-Shot Learning via Synthesized Examples
We present a generative framework for generalized zeroshot learning where the training and test classes are not necessarily disjoint. Built upon a variational autoencoder based architecture, consisting of a probabilistic encoder and a probabilistic conditional decoder, our model can generate novel exemplars from seen/unseen classes, given their respective class attributes. These exemplars can s...
متن کاملImproving zero-shot learning by mitigating the hubness problem
The zero-shot paradigm exploits vector-based word representations extracted from text corpora with unsupervised methods to learn general mapping functions from other feature spaces onto word space, where the words associated to the nearest neighbours of the mapped vectors are used as their linguistic labels. We show that the neighbourhoods of the mapped elements are strongly polluted by hubs, v...
متن کاملOrdinal Zero-Shot Learning
Zero-shot learning predicts new class even if no training data is available for that class. The solution to conventional zero-shot learning usually depends on side information such as attribute or text corpora. But these side information is not easy to obtain or use. Fortunately in many classification tasks, the class labels are ordered, and therefore closely related to each other. This paper d...
متن کاملZero-Shot Kernel Learning
In this paper, we address an open problem of zero-shot learning. Its principle is based on learning a mapping that associates feature vectors extracted from i.e. images and attribute vectors that describe objects and/or scenes of interest. In turns, this allows classifying unseen object classes and/or scenes by matching feature vectors via mapping to a newly defined attribute vector describing ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Multimedia
سال: 2022
ISSN: ['1520-9210', '1941-0077']
DOI: https://doi.org/10.1109/tmm.2022.3190678